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Abstract 

Python implementation of selected weighted graph algorithms is presented. The minimal graph 
interface is defined together with several classes implementing this interface. Graph nodes can be 
any hashable Python objects. Directed edges are instances of the Edge class. Graphs are instances 
of the Graph class. It is based on the adjacency-list representation, but with fast lookup of nodes 
and neighbors (dict-of-dict structure). Other implementations of this class are also possible. 

In this work, many algorithms are implemented using a unified approach. There are separate 
classes and modules devoted to different algorithms. Three algorithms for hnding a minimum span¬ 
ning tree are implemented: the Boruvka’s algorithm, the Prim’s algorithm (three implementations), 
and the Kruskal’s algorithm. Three algorithms for solving the single-source shortest path problem 
are implemented: the dag shortest path algorithm, the Bellman-Ford algorithm, and the Dijkstra’s 
algorithm (two implementations). Two algorithms for solving all-pairs shortest path problem are 
implemented: the Floyd-War shall algorithm and the Johnson’s algorithm. 

All algorithms were tested by means of the unittest module, the Python unit testing framework. 
Additional computer experiments were done in order to compare real and theoretical computational 
complexity. The source code is available from the public GitHub repository. 


* Corresponding author: andrzej.kapanowski@uj.edu.pl 
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I. INTRODUCTION 


Algorithms are at the heart of computer science. They are expressed as a finite sequence 
of operations with clearly defined input and output. Algorithms can be expressed by natu¬ 
ral languages, pseudocode, flowcharts or programming languages. Pseudocode is often used 
in textbooks and scientific publications because it is compact and augmented with natural 
language description. However, in depth understanding of algorithms is almost impossible 
without computer experiments where algorithms are implemented by means of a program¬ 
ming language. 

Our aim is to show that the Python programming language {l| can be used to implement 
algorithms with simplicity and elegance. On the other hand, the code is almost as readable 
as pseudocode because the Python syntax is very clear. That is why Python has become 
one of the most popular teaching languages in colleges and universities j^, and it is used 
in scientific research j^. Python implementation of some algorithms from computational 
group theory was shown in Ref. jd]. In this paper we are interested in graph theory and 
weighted graph algorithms. Other algorithms will be discussed elsewhere. The source code 
of our programs is available from the public GitHub repository Q. We strongly support a 
movement toward open source scientific software 6|. 

Graphs can be used to model many types of relations and processes in physical, biological, 
social, and information systems Q. Many practical problems can be represented by graphs 
and this can be the first step to finding the solution of the problem. Sometimes the solution 
can be known for a long time because graph theory was born in 1736. In this year Leonard 
Euler solved the famous Seven Bridges of Konigsberg (Krolewiec in Polish) problem. 

We examined several Python graph packages from The Python Package Index jsj in 


order to checked different approaches. Some 
languages were checked briefly B. H, 


iii, ii^, 


grap 


1 libraries written in other programming 
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d’ackage NetworkX 1.9 hy Aric A. Hagberg, Daniel A. Schult, and Pieter J. Swart 
I^ . A library for the creation, manipulation, and study of the structure, dynamics, 
and functions of complex networks. Basic classes are Graph, DiGraph, MultiGraph, 
and MultiDiGraph. All graph classes allow any hashable object as a node. Arbitrary 
edge attributes can be associated with an edge. The dictionary of dictionaries data 
structure is used to store graphs. NetworkX is integrated into Sage. 
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Package python-igraph 0.7.0 1^. igraph is the network analysis package with versions 
for R, Python, and C/C++. 

Package python-graph 1.8.2 by Pedro Matiello A library for working with graphs 
in Python. Edges are standard Python tuples, weights or labels are kept separately. 
There are different classes for directed graphs, undirected graphs, and hypergraphs. 


Package graph O .4 by Robert Dick and Kosta Gaitanis 
graph data structures and algorithms. 


18| . Directed and undirected 


None of the available implementations satisfy our needs. Usually high computational 
efficiency leads to the unreadable code. On the other hand, C/C++ syntax or Java syntax 
are not very close to the pseudocode used for learning algorithms. That is why we develop 
and advocate our approach. We note that presented Python implementations may be not 
as fast as C/C++ or Java counterparts but they scale with the input size according to the 
theory. The presented algorithms are well known and that is why we used references to 
many Wikipedia pages. 

The paper is organized as follows. In Section |TT] basic dehnitions from graph theory are 
given. In Section HTTI the graph interface is presented. In Sections lIVlIVl and|VT]the following 
algorithms are shown; for Ending a minimum spanning tree, for solving the single-source 
shortest path problem, for solving all-pair shortest path problem. Conclusions are contained 
in Section |vni 


II. DEFINITIONS 


Definitions of graphs vary and that is why we will present our choice which is very common 
jl9|. We will not consider multigraphs with loops and multiple edges. 


A. Graphs 

A (simple) graph is an ordered pair G = (V, E), where U is a finite set of nodes (vertices, 
points) and U is a finite set of edges (lines, arcs, links). An edge is an ordered pair of 
different nodes from V, (s, t), where s is the source node and t is the target node. This is a 
directed edge and in that case G is called a directed graph. 
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An edge can be defined as a 2-elenient subset of V, {s,f} = and then it is called 

an undirected edge. The nodes s and t are called the ends of the edge (endpoints) and 
they are adjacent to one another. The edge connects or joins the two endpoints. A graph 
with undirected edges is called an undirected graph. In our approach, an undirected edge 
corresponds to the set of two directed edges {{s,t), {t, s)} and the representative is usually 
(s, t) with s < t. 

The order of a graph G = (Id, E) is the number of nodes |Id|. The degree of a node in an 
undirected graph is the number of edges that connect to it. 

A graph G' = (1^', E') is a subgraph of a graph G = (Id, E) if Id' is a subset of Id and E' 
is a subset of E. 


B. Graphs with weights 

A graph structure can be extended by assigning a number (weight) w{s,t) to each edge 
(s,f) of the graph. Weights can represent lengths, costs or capacities. In that case a graph 
is a weighted graph. 


C. Paths and cycles 

A path P from s to t in a graph G = (V, E) is a sequence of nodes from Id, (uq, Ui,..., Vn), 
where Vq = s, Vn = t, and {vi_i,Vi) {i = 1,... ,n) are edges from E. The length of the path 
P is n. A simple path is a path with distinct nodes. The weight (cost or length) of the path 
in a weighted graph is the sum of the weights of the corresponding edges. 

A cycle is a path C starting and ending at the same node, Uq = Vn. A simple cycle is 
a cycle with no repetitions of nodes allowed, other than the repetition of the starting and 
ending node. 

A directed path (cycle) is a path (cycle) where the corresponding edges are directed. In 
our implementation, a path is a list of nodes. A graph is connected if for every pair of nodes 
s and t, there is a path from s to t. 
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D. Trees 


A (free) tree is a connected undirected graph T with no cycles. A forest is a disjoint union 
of trees. A spanning tree of a connected, undirected graph G is a tree T that includes all 


nodes of G and is a subgraph of G 


20| . Spanning trees are important because they construct 


a sparse subgraph that tells a lot about the original graph. Also some hard problems can 
be solved approximately by using spanning trees (e.g. traveling salesman problem). 

A rooted tree is a tree T where one node is designated the root. In that case, the edges 
can be oriented towards or away from the root. In our implementation, a rooted tree is kept 
as a dictionary, where keys are nodes and values are parent nodes. The parent node of the 
root is None. A forest of rooted trees can be kept in a dictionary with many roots. 

A shortest-path tree rooted at node s is a spanning tree T of G, such that the path 
distance from root s to any other node f in T is the shortest path distance from s to t in G. 

III. INTERFACE FOR GRAPHS 


According to the dehnitions from Section [TTl graphs are composed of nodes and edges. In 
our implementation nodes can be any hashable object that can be sorted. Usually they are 
integer or string. 

Edges are instances of the Edge class (edges module) and they are directed, hashable, and 
comparable. Any edge has the starting node (edge.source), the ending node (edge.target), 
and the weight (edge.weight). The default weight is one. The edge with the opposite 


direction is equa’ 


planar graphs 


2l|. 


to edge. This is very useful for combinatorial maps used to represent 


Simple graphs (directed and undirected) are instances of the Graph class (graphs module). 
Multigraphs (directed and undirected) are instances of the MultiGraph class (multigraphs 
module) and they will not be discussed here. Let us show some properties of graphs that 
are listed in Table [11 There are methods to report some numbers (nodes, edges, degrees). 
There are iterators over nodes and edges. There are also some logical functions. 

»> from edges import Edge 
»> from graphs import Graph 
»> G = Graph(n=3, directed=False) 
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# n is for compatibility with other implementations . 

>» G. is_directed () 

F alse 

»> G. add_edge (Edge (’A’ , ’B’, 5)) 

»> G.add_edge(Edge(’A’ , ’G’, 7)) 

# Nodes are added by default. 

»> print G.v(), G.e() # numbers of nodes and edges 

3 2 


»> list(G.iternodes()) 

[’A’, ’G’, ’B’] 

»> sorted ((G. degree (v) for 

| 2 , 1 , 1 | 

»> list(v for V in G. iterno 
[’G’, ’B’] 

»> sum ( edge . weight for edge 
12 

# Typical usage of an algorii 
»> from foo import Foo 
»> algorithm = Foo(G) 

»> algorithm . run () 

»> print algorithm . result 


# random order of nodes 

in G. iternodes ()) , reverse=True) 

# the degree sequence 

es() if G. degree (v) 1) 

# leafs 

in G. iteredges ()) 
ff the graph (tree) weight 
im Foo from the module foo. 

# initialization 
calculations 

ff results can be more... 


Note that in the case of undirected graphs, edge and “edge are two representatives of 
the same undirected edge. The method iteredges returns a representative with the ordering 
edge.source < edge.target. 

Graph algorithms are implemented using a unihed approach. There is a separate class 
for every algorithm. Different implementations of the same algorithm are grouped in one 

module. In the_init_method, main variables and data structures are initialized. The 

name space of a class instance is used to access the variables and that is why interfaces of 
the auxiliary methods can be very short. 
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TABLE 1. Interface for graphs; G is a graph, s and t are nodes. 


Method name 

Short description 

Graph(n) 

return an empty undirected graph 

Graph(n, directed= 

:True) return an empty directed graph 

G.is directed() 

return True if G is a directed graph 

G.v() 

return the number of nodes 

G.eQ 

return the number of edges 

G.add node(s) 

add s to G 

G.del node(s) 

remove s form G 

G.has_node(s) 

return True if s is in G 

G. add _ edge (edge) 

add a new edge to G 

G.del_edge(edge) 

remove the edge form G 

G. has _edge (edge) 

return True if the edge is in G 

G. weight (edge) 

return the edge weight or zero 

G.iternodes() 

generate nodes on demand 

G.iteredges() 

generate edges on demand 

G. iterout edges (s) 

generate outedges on demand 

G.iterinedges(s) 

generate inedges on demand 

G.degree(s) 

return the degree of s (G undirected) 

G. indegree (s) 

return the indegree of s 

G.outdegree(s) 

return the outdegree of s 


IV. MINIMUM SPANNING TREE 


Let us assume that G = (V, E) is a connected, undirected, weighted graph, and T is a 
spanning tree of G. We can assign a weight to the spanning tree T by computing the sum 
of the weights of the edges in T. A minimum spanning tree (MST) is a spanning tree with 


the weight less than or equal to the weight of every other spanning tree 


22| . In general. 


MST is not unique. There is only one MST if each edge has a distinct weight (a proof by 
contradiction). 
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Minimum spanning trees have many practical applications: the design of networks, tax¬ 


onomy, cluster analysis, and others 


22|. We would like to present three classical algorithms 


for finding MST that run in polynomial time. 


A. Boriivka’s algorithm 


The Boruvka’s algorithm works for a connected graph whose edges have distinct weights 
what implies the unique MST. In the beginning, the cheapest edge from each node to another 
in the graph is found, without regard to already added edges. Then joining these groupings 
continues in this way until MST is completed 2^. Components of MST are tracked using 


a disjoint-set data structure. The algorithm runs in 0{E\ogV) time. 

Our implementation of the Boruvka’s algorithm works also for disconnected graphs. In 
that case a forest of minimum spanning trees is created. What is more, repeated edge 
weights are allowed because our edge comparison use also nodes, when weights are equal. 


from edges import Edge 

from unionfind import UnionFind 


class BoruvkaMST : 

"""Boruvka’s algorithm for finding MST. """ 

def _init_ (self , graph); 

"""The algorithm initialization.""" 
self.graph = graph 

self.mst = graph. _class_ (graph. v()) # MST as a graph 

self.uf = UnionFind() 

def run(self); 

""" Executable pseudocode. " "" 
for node in self . graph . iternodes (): 
self.uf.create(node) 

forest = set (node for node in se If . graph . iternodes ()) 
dummy_edge = Edge (None, None, f loat (" inf " )) 
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new_len = len( forest) 
old_len = new_len + 1 
while old_len > new_len; 
old_len = new_len 

min_edges = diet (((node, dummy_edge) 
for node in forest)) 

# Finding the cheapest edges. 

for edge in self.graph.iteredges(): # 0(E) time 
source = s e 1 f . uf . find ( edge . source ) 
target = s e 1 f . uf . find ( edge . t arget) 
if source != target : # different components 

if edge < niin_edges [ source ] ; 

niin_edges [ source ] = edge 
if edge < niin_edges [ target ] ; 
niin_edges [ target ] = edge 

# Connecting components, total time is 0(V). 
forest = set() 

for edge in niin_edges . itervalues (): 

if edge is duniniy_edge: # a disconnected graph 

continue 

source = s e 1 f . uf . find ( edge . source ) 
target = s e 1 f . uf . find ( edge . t arget) 
if source != target : # different components 

s e 1 f . uf . union ( source , target) 
forest .add(source) 
self, rust . add_edge ( edge ) 

# Remove duplicates , total time is 0(V). 

forest = set ( s e 1 f . uf . find ( node) for node in forest) 
new_len = leu (forest) 

if new_len 1; # a connected graph 

break 
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We note that the Boruvka’s algorithm is well suited for parallel computation. 


B. Prim’s algorithm 

The Prim’s algorithm has the property that the edges in growing T always form a single 
tree. The weights from G can be negative We begin with some node s from G and s 
is added to the empty T. Then, in each iteration, we choose a minimum-weight edge (s,f), 
joining s inside T to t outside T. Then the minimum-weight edge is added to T. This 
process is repeated until MST is formed. 

The performance of Prim’s algorithm depends on how the priority queue is implemented. 
In the case of the first implementation a binary heap is used and it takes 0{E\ogV) time. 

from edges import Edge 

from Queue import PriorityQueue 


class PrimMST: 

"""Prim’s algorithm for finding MST. """ 

def_init_ (self , graph); 

"""The algorithm initialization.""" 
self.graph = graph 

s e 1 f . dist ance = dict((node, f loat (" inf ")) 
for node in se 1 f . graph . iternodes ()) 
self.parent = diet((node. None) 

for node in s e 1 f . graph . it er nodes ()) # MST as a diet 
s e 1 f . in_queue = diet ((node. True) 

for node in s e 1 f . graph . it er nodes ()) 
self.pq = PriorityQueue () 

def run (self , source=None ) : 

"""Executable pseudocode. """ 

if source is None: # get first random node 

source = self . graph . iternodes (). next () 
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self.source = source 

se 1 f . distance [ source ] = 0 

for node in self . graph . iternodes (): 

s e 1 f . pq . put (( s e 1 f . dist ance [ node ] , node )) 
while not se 1 f . pq . empty (): 

node = se1f.pq.get() 
if self, in _ queue[node]; 

se 1 f . in_queue [ node ] = False 
else ; 

continue 

for edge in self . graph . iteroutedges (node ): 
if ( se 1 f . in_queue [ edge . target ] and 

edge . weight < se 1 f . dist ance [ edge . t arget ]) ; 

s e 1 f . dist ance [ edge . t arget ] = edge . weight 
se1f.parent[edge.target] = edge.source 
s e 1 f . pq . put (( edge . weight , edge.target)) 

The second implementation is better for dense graphs (|i?| ~ |h^P), where the adjacency- 
matrix representation of graphs is often used. For-loop is executed |y| times, hnding the 
minimum takes 0{V). Therefore, the total time is 0(V^^). 

class PrimMatrixMST ; 

"""Prim’s algorithm for finding MST in 0(V**2) time. """ 

def_init _(self , graph); 

"""The algorithm initialization.""" 
self.graph = graph 

s e 1 f . dist ance = dict((node, float (" inf ")) 
for node in se 1 f . graph . iternodes ()) 
self, parent = diet ((node. None) 

for node in s e 1 f . graph . it er nodes ()) # MST as a diet 
s e 1 f . in_queue = diet ((node. True) 

for node in s e 1 f . graph . it er nodes ()) 
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// // // 


def run (self , source=None ) : 

"""Executable pseudocode. 
if source is None: # get first random node 

source = self . graph . iternodes (). next () 
self.source = source 
se 1 f . distance [ source ] = 0 

for step in xrange ( se 1 f . graph . v ()): # /F/ times 
# Find min node in the graph, 0(V) time. 
node = min((node for node in self . graph . iternodes () 
if se If . in_queue [ node ]) , key=s e 1 f . dist ance . get) 
s e 1 f . in_queue [ node ] = False 

for edge in self . graph . iteroutedges (node ): # 0(V) time 
if ( self . in_queue [ edge . target ] and 

edge . weight < se If . distance [ edge . target ]) ; 

s e 1 f . dist ance [ edge . t arget ] = edge . weight 
se1f.parent[edge.target] = edge.source 


C. Kruskal’s algorithm 


The Kruskal’s algorithm builds the MST in forest 


25| . 2d |. In the beginning, each node 


is in its own tree in forest. Then all edges are scanned in increasing weight order. If an edge 
connects two different trees, then the edge is added to the MST and the trees are merged. 
If an edge connects two nodes in the same tree, then the edge is discarded. 

The presented implementation uses a priority queue in order to sort edges by weights. 
Components of the MST are tracked using a disjoint-set data structure (the UnionFind 
class). The total time is 0{E\ogV). 


from unionfind import UnionFind 
from Queue import PriorityQueue 


class KruskalMST ; 
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"""Kruskal’s algorithm for finding MST. """ 


def_init_ (self , graph); 

"""The algorithm initialization.""" 
self.graph = graph 

self.mst = graph._class _(graph. v()) # MST as a graph 

self.uf = UnionFind() 
self.pq = Pr iorityQueue () 

def run(self): 

""" Executable pseudocode. """ 

for node in self.graph.iternodes(): 0(V) time 

self.uf.create(node) 

for edge in self . graph . iteredges (): #0(E*log(V)) time 
s e 1 f . pq . put (( edge . weight , edge)) 
while not se 1 f . pq . empty (): jEj steps 

edge = s e 1 f . pq . get () # 0(log(V)) time 

if ( se 1 f . uf . find ( edge . source ) ! = 
self.uf.find( edge .target )): 

self.uf.union( edge . source , edge . target) 
self, mst . add_edge ( edge ) 

The Kruskal’s algorithm is regarded as best when the edges can be sorted fast or are 
already sorted. 


V. SINGLE-SOURCE SHORTEST PATH PROBLEM 


Let us assume that G = (V, E) is a weighted directed graph. The shortest path problem is 
he problem of finding a path between two nodes in G such that the path weight is minimized 


27| . The negative edge weights are allowed but the negative weight cycles are forbidden (in 


that case there is no shortest path). 

The Dijkstra’s algorithm and the Bellman-Ford are based on the principle of relaxation. 
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Approximate distances (overestimates) are gradually replaced by more accurate values until 
the correct distances are reached. However, the details of the relaxation process differ. Many 
additional algorithms may be found in Ref. 281. 


A. Bellman-Ford algorithm 


The Bellman-Ford algorithm computes shortest paths from a single source to all of the 
other nodes in a weighted graph G in which some of the edge weights are negative 29|. The 


algorithm relaxes all the edges and it does this |R| — 1 times. At the last stage, negative 
cycles detection is performed and their existence is reported. The algorithm runs in 0{V -E) 
time. 

class BellmanFord : 

"""The Bellman—Ford algorithm for the shortest path problem. """ 


def _init_ (self , graph); 

"""The algorithm initialization.""" 
if not graph . is_directed (): 

raise ValueError (" graph is not directed") 
self.graph = graph 

s e 1 f . dist ance = dict(((node, float (" inf ")) 
for node in se 1 f . graph . iternodes ())) 

# Shortest path tree as a dictionary . 
self.parent = diet (((node. None) 

for node in s e 1 f . graph . it er nodes ())) 

def run(self, source): 

""" Executable pseudocode. """ 
self.source = source 
s e 1 f . dist ance [ source ] = 0 

for step in xrange ( se 1 f . graph . v ()—1): # l^l~i times 

for edge in self.graph.iteredges(): # 0(E) time 

self. _relax ( edge ) 
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# Check for negative cycles. 

for edge in self.graph.iteredges(): # 0(E) time 

if (self. distance[edge.target] > self.distance[edge.source 
+ edge . weight ): 

raise ValueError (" negative cycle") 

def _relax(self, edge): 

"""Edge relaxation . """ 

alt = self . distance [ edge . source ] + edge . weight 
if se If . distance [ edge . target ] > alt; 
s e 1 f . dist ance [ edge . t arget ] = alt 
se1f.parent[edge.target] = edge.source 
return True 
return False 

def path(self, target): 

"""Construct a path from source to target. """ 
if se If .source target: 

return [se1f.source] 
elif se1f.parent[target] is None: 

raise ValueError (" no path to target") 
else ; 

return se1f . path(se1f.parent[target]) + [target] 


B. Dijkstra’s algorithm 


The Dijkstra’s algorithm solves the single-source shortest path prob 


with non-negative edge weights, producing a shortest path tree T 




em for a graph G 


31|. It is a greedy 


algorithm that starts at the source node, then it grows T and spans all nodes reachable from 
the source. Nodes are added to T in order of distance. The relaxation process is performed 
on outgoing edges of minimum-weight nodes. The total time is 0{E\ogV). 
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from Queue import PriorityQueue 


class Dijkstra : 

""" Dijkstra ’s algorithm for the shortest path problem. """ 

def_init_(self , graph); 

"""The algorithm initialization.""" 
if not graph . is_directed (); 

raise ValueError (" graph is not directed") 
self.graph = graph 

s e 1 f . dist ance = dict((node, float (" inf ")) 
for node in se 1 f . graph . iternodes ()) 
self.parent = diet ((node, None) 

for node in s e 1 f . graph . it er nodes ()) 
s e 1 f . in_queue = diet ((node, True) 

for node in s e 1 f . graph . it er nodes ()) 
self.pq = PriorityQueue () 

def run (self , source ): 

"""Executable pseudocode. """ 

self.source = source 

s e 1 f . dist ance [ source ] = 0 

for node in self . graph . iternodes (): 

s e 1 f . pq . put (( s e 1 f . dist ance [ node ] , node )) 
while not s e 1 f . pq . empty (): 

_, node = se1f.pq.get() 
if self, in_queue[node]; 

se 1 f . in_queue [ node ] = False 
else ; 

continue 

for edge in self . graph . iteroutedges (node ): 
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if se 1 f . in_queue [ edge . t arget ] and se 1 f . _relax ( edge ): 
self. pq.put((self.distance[edge.target] , 
edge.target)) 

The second implementation is better for dense graphs with the adjacency-matrix repre¬ 
sentation, the total time is 0{V‘^). 

class DijkstraMatrix ; 

" "" Dijkstra ’s algorithm with 0(V**2) time. """ 

def_ init _(self , graph); 

"""The algorithm initialization.""" 
if not graph . is_directed (): 

raise ValueError (" graph is not directed") 
self.graph = graph 

s e 1 f . dist ance = dict((node, float (" inf ")) 
for node in se 1 f . graph . iternodes ()) 
self.parent = diet((node, None) 

for node in s e 1 f . graph . it er nodes ()) 
s e 1 f . in_queue = diet ((node. True) 

for node in s e 1 f . graph . it er nodes ()) 

def run (self , source ): 

""" Executable pseudocode. """ 
self.source = source 
s e 1 f . dist ance [ source ] = 0 

for step in xrange ( se 1 f . graph . v ()): # /E/ times 

# Find min node, 0(V) time. 

node = min (( node for node in self . graph . iternodes () 
if self, in _ queue [ node ]) , key=s e If.distance.get) 
se 1 f . in_queue [ node ] = False 

for edge in self . graph . iteroutedges (node ): # 0(V) time 
if self, in_queue[edge.target]; 
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self. _relax ( edge ) 


Le us note that a time oi 0{E + V log V) is best possible for Dijkstra’s algorithm, if edge 


weights are real numbers and only binary comparisons are used. This bond is attainable 



using Fibonacci heaps, relaxed heaps or Vheaps (32|. Researchers are also working on 


adaptive algorithms which proht from graph easiness (small density, not many cycles). 

C. DAG Shortest Path 

Shortest paths in DAG are always defined because there are no cycles. The algorithm 



runs m. 0{y + E) time because topological sorting of graph nodes is conducted jl9| 


from topsort import TopologicalSort 

class DAGShortestPath: 

"""The shortest path problem for DAG. """ 

def _init_ (self , graph); 

"""The algorithm initialization.""" 
if not graph . is_directed (): 

raise ValueError (" graph is not directed") 
self.graph = graph 

s e 1 f . dist ance = dict((node, f loat (" inf ")) 
for node in se 1 f . graph . iternodes ()) 
self.parent = diet((node. None) 

for node in s e 1 f . graph . it er nodes ()) 

def run(self, source): 

""" Executable pseudocode. " " " 

self.source = source 

s e 1 f . dist ance [ source ] = 0 

algorithm = TopologicalSort ( self . graph) 

algorithm . run () 
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for source in algorithm . sorted_nodes : 

for edge in self . graph . iteroutedges ( source ): 
self. _relax ( edge ) 


VI. ALL-PAIRS SHORTEST PATH PROBLEM 


Let us assume that G = (P, E) is a weighted directed graph. The all-pairs shortest 


path problem is the problem of hnding shortest paths between every pair of nodes 


27|. 


Two algorithms solving this problem are shown: the Floyd-Warshall algorithm and the 
Johnson’s algorithm. There is the third algorithm in our repository, which is based on 
matrix multiplication (the allpairs .py module) A basic version has a running time of 
0(1/^) (the SlowAllPairs class), but it is improved to O(P^logP) (the FasterAllPairs class). 


A. Floyd-Warshall algorithm 


The Floyd-Warshall algorithm computes shortest paths in a weighted graph with positive 


or negative edge weights, but with no negative cycles 


33|. The algorithm uses a method of 


dynamic programming. The shortest path from s to t without intermediate nodes has the 
length w{s, t). The shortest paths estimates are incrementally improved using a growing set 
of intermediate nodes. 0{V^) comparisons are needed to solve the problem. Three nested 
for-loops are the heart of the algorithm. 

Our implementation allows the reconstruction of the path between any two endpoint 
nodes. The shortest-path tree for each node is calculated and O(V^) memory is used. The 
diagonal of the path matrix is inspected and the presence of negative numbers indicates 
negative cycles. 

class FloydWarshallPaths : 

"""The Floyd—Warshall algorithm with path reconstruction. """ 


def _init_ (self , graph); 

"""The algorithm initialization . 
if not graph . is_directed (): 
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raise ValueError (" graph is not directed") 
self.graph = graph 
s e 1 f . dist ance = dict() 
self.parent = dict() 

for source in se 1 f . graph . iternodes (): 
s e 1 f . dist ance [ source ] = dict() 
s e 1 f . parent [ sour ce ] = dict() 
for target in se 1 f . graph . iternodes (); 

s e 1 f . dist ance [ source ][ t arget ] = float("inf") 
s e 1 f . parent [ sour ce ][ t arget ] = None 
s e 1 f . dist ance [ source ][ source ] = 0 
for edge in self . graph . iteredges (): 

s e 1 f . dist ance [ edge . source ][ edge . t arget ] = edge . weight 
se1f . parent[edge.source][edge.target] = edge.source 

def run(self); 

""" Executable pseudocode. """ 

for node in self . graph . iternodes () : 

for source in se 1 f . graph . iternodes (); 

for target in se If . graph . iternodes (): 

alt = se 1 f . distance [ source ][ node ] + \ 
self.distance [node][target] 
if alt < self . distance [ source ][ target ] ; 
s e 1 f . dist ance [ source ][ t arget ] = alt 
s e 1 f . parent [ sour ce ][ t arget ] = \ 
self.parent[node][target] 
if any(self.distance[node][node] < 0 
for node in self . graph . iternodes ()): 

raise ValueError (" negative cycle detected") 

def path(self, source, target); 
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"""Path reconstruction . """ 
if source target: 
return [source] 

elif self.parent[source][target] is None: 

raise ValueError (" no path to target") 
else : 

return se If.path(source , se1f.parent[target]) + [target] 


B. Johnson’s algorithm 

The Johnson’s algorithm finds the shortest paths between all pairs of nodes in a sparse 
directed graph. The algorithm uses the technique of reweighting. It works by using the 
Bellman-Ford algorithm to compute a transformation of the input graph that removes all 
negative weights, allowing Dijkstra’s algorithm to be used on the transformed graph H- 
The time complexity of our implementation is OiVElogV), because the binary min-heap 
is used in the Dijkstra’s algorithm. It is asymptotically faster than the Floyd-Warshall 
algorithm if the graph is sparse. 

from edges import Fdge 

from bellmanford import BellmanFord 

from dijkstra import Dijkstra 

class Johnson: 

"""The Johnson algorithm for the shortest path problem. """ 

def _init_ (self , graph): 

"""The algorithm initialization.""" 
if not graph . is_directed (): 

raise ValueError (" graph is not directed") 
self.graph = graph 

def run(self): 
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"""Executable pseudocode. """ 

s e 1 f . new_graph = self, graph._class_( 

s e 1 f . graph . V () + 1, directed=True) 
for node in self.graph.iternodes(): # 0(V) time 

self. new_graph . add_node ( node) 
for edge in self.graph.iteredges(): # 0(E) time 

self. new_graph . add_edge ( edge ) 
s e 1 f . new_node = s e 1 f . graph . v () 
self. new_graph . add_node (self. new_node) 
for node in self.graph.iternodes(): # 0(V) time 

s e 1 f . new_graph . add_edge (Edge ( s e 1 f . new_node , node, 0)) 
self.bf = BellmanFord ( se If . new_graph) 

# If this step detects a negative cycle, 
the algorithm is terminated. 

se 1 f . bf . run ( se 1 f . new_node) # 0(V*E) time 
Edges are reweighted. 

for edge in list (self. new_graph .iteredges()): # 0(E) time 

edge . weight = (edge, weight 

+ self.bf.distance[edge.source] 

— se1f.bf. distance[edge.target]) 
self. new_graph . del_edge ( edge ) 
self. new_graph . add_edge ( edge ) 

# Remove new_node with edges. 

self. new_graph . del_node (self. new_node) 

# Weights are now modified! 
s e 1 f . dist ance = dict() 

for source in self . graph . iternodes () : 
s e 1 f . dist ance [ source ] = dict() 
algorithm = Dijkstra ( self . new_graph) 
algorithm .run (source) 

for target in se 1 f . graph . iternodes () : 
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se 1 f . distance [ source ][ target ] = ( 
algorithm . distance [target] 

— self . bf . distance [ source ] 

+ self.bf.distance[target]) 


VII. CONCLUSIONS 

In this paper, we presented Python implementation of several weighted graph algorithms. 
The algorithms are represented by classes where graph objects are processed via proposed 
graph interface. The presented implementation is unique in several ways. The source code 
is readable like a pseudocode from textbooks or scientific articles. On the other hand, the 
code can be executed with efficiency established by the corresponding theory. Python’s 
class mechanism adds classes with minimum of new syntax and it is easy to create desired 
data structures (e.g., an edge, a graph, a union-find data structure) or to use objects from 
standard modules (e.g., queues, stacks). 

The source code is available from the public GitHub repository Q. It can be used in ed¬ 
ucation, scientific research, or as a starting point for implementations in other programming 
languages. The number of available algorithms is growing, let us list some of them; 

• Graph traversal (breadth-first search, depth-first search) 

• Gonnectivity (connected components, strongly connected components) 

• Accessibility (transitive closure) 

• Topological sorting 

• Gycle detection 

• Testing bipartiteness 

• Minimum spanning tree 

• Matching (Augmenting path algorithm, Hopcroft-Karp algorithm) 

• Shortest path search 
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• Maximum flow (Ford-Fulkerson algorithm, Edmonds-Karp algorithm) 

• Graph generators 
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